cd/entity/BlueDot Technical AI Safety Projectยท homeโ€บ entitiesโ€บ BlueDot Technical AI Safety Project
grep -l @bluedot technical ai safety project /news/*.json | wc -l โ†’ 1

@BlueDot Technical AI Safety Project

mentions 1 type Organization feed RSS
05:58
2026-05-30
lesswrong.com
ai-safety

Belief manifolds, and how to steer along them

A BlueDot Technical AI Safety Project researcher reproduced a study from Goodfire demonstrating that language model representations form curved geometric manifolds, not simple linear directions. The wโ€ฆ

// co-occurs with top 5 entities